Back

Scientific Data

30 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
The Effects of External Laser Positioning Systems for MRI Simulation on Image Quality and Quantitative MRI Values
2026-03-07 radiology and imaging 10.64898/2026.03.06.26347809
#1 (5.0%)
Show abstract

Background and Purpose: Magnetic resonance imaging (MRI) for radiation therapy treatment planning is currently being used in many anatomical sites to better visualize soft tissue landmarks, a technique known as an MRI simulation. A core component of modern MRI simulation configurations are the use of external laser positioning systems (ELPS) to help set up the patient. Though necessary for accurate and reproducible patient setup, the ELPS, if left on during imaging, may interfere negatively with...

2
BUDAPEST: A Fast and Reliable Bayesian Algorithm for TMS Threshold Estimation with an Open-Source GUI and Human Validation
2026-03-04 radiology and imaging 10.64898/2026.03.03.26347528
Top 0.3% (3.0%)
Show abstract

BackgroundMotor threshold (MT) estimation is fundamental to transcranial magnetic stimulation (TMS), guiding individualized stimulation intensity in research and therapy. Conventional methods such as the 5-out-of-10 rule require many stimuli, while adaptive approaches like Parameter Estimation by Sequential Testing (PEST) improve efficiency but can exhibit poor convergence under certain conditions. ObjectiveThis study introduces the Bayesian Uncertainty Dynamic Algorithm for Parameter Estimatio...

3
DBT-2026, a de-identified publicly available dataset of digital breast tomosynthesis exams with ground truth biopsies
2026-03-04 radiology and imaging 10.64898/2026.03.03.25337924
Top 0.9% (1.9%)
Show abstract

Digital breast tomosynthesis (DBT) is a powerful imaging modality that allows for improved lesion visibility, characterization, and localization compared to conventional two-dimensional digital mammography. DBT has been increasingly adopted in screening and diagnostic settings globally, particularly for women with dense breast tissue where tissue overlap presents a significant diagnostic challenge. Here we describe DBT-2026, a real world imaging dataset with 558 DBT exams from 558 patients with ...

4
Lesion-Centric Latent Phenotypes from Segmentation Encoders for Breast Ultrasound Interpretability
2026-03-06 radiology and imaging 10.64898/2026.03.06.26347800
Top 1% (1.6%)
Show abstract

We propose a lesion-centric phenotype learning pipeline for interpretable breast ultrasound (BUS). Predicted lesion masks are used for mask-weighted pooling of segmentation-encoder latents, producing compact embeddings that suppress background influence; a lightweight calibration step improves cross-dataset consistency. We cluster embeddings to discover latent phenotypes and relate phenotype structure to morphology descriptors (compactness, boundary sharpness). On BUSI and BUS-UCLM with external...

5
NIR autofluorescence allows for pituitary gland detection during surgery: the first evidence from microscopic studies and in vivo measurements
Top 2% (1.5%)
Show abstract

A critical challenge in endocrine neurosurgery is intraoperative discrimination between normal pituitary tissue and pituitary neuroendocrine tumors (PitNETs). Suggesting the universal persistence of near-infrared autofluorescence (NIRAF) in endocrine organs and inspired by routine clinical use of NIRAF for parathyroid gland identification, we discovered that pituitary NIRAF can be employed for label-free transsphenoidal surgery guidance. Ex vivo confocal spectral imaging of 33 specimens identifi...

6
Intelligent Guidance and Diagnostic Assistance for Handheld Ultrasound: Actor-Critic Based Approach for Carotid Artery and Thyroid Examination
2026-03-04 radiology and imaging 10.64898/2026.03.02.26347395
Top 2% (1.3%)
Show abstract

Handheld ultrasound devices have revolutionized point-of-care diagnostics, but their effectiveness remains limited by operator dependency and the need for specialized training. This paper presents an intelligent guidance and diagnostic assistance system for the handheld wireless ultrasound device, enabling automated carotid artery and thyroid examinations through handheld operation. Drawing inspiration from the Actor-Critic framework, we implement a simulation-based reinforcement learning approa...

7
Thyroid Cancer Risk Prediction from Multimodal Datasets Using Large Language Model
2026-03-06 health informatics 10.64898/2026.03.05.26347766
Top 2% (1.1%)
Show abstract

Thyroid carcinoma is one of the most prevalent endocrine malignancies worldwide, and accurate preoperative differentiation between benign and malignant thyroid nodules remains clinically challenging. Diagnostic methods that medical practitioners use at present depend on their personal judgment to evaluate both imaging results and separate clinical tests, which creates inconsistency that leads to incorrect medical evaluations. The combination of radiological imaging with clinical information syst...

8
Differentiating radiation necrosis from recurrent brain metastases using magnetic resonance elastography
2026-03-06 radiology and imaging 10.64898/2026.03.04.26347674
Top 2% (1.1%)
Show abstract

Abstract Background: Conventional MRI cannot reliably distinguish radiation necrosis (RN) from recurrent metastasis after cranial radiotherapy, as both can show similar enhancement despite different biology. We tested whether these entities are mechanically non-equivalent in vivo and separable by MRE-derived viscoelastic metrics and perilesional interface-instability features. Methods: In a prospective, histopathology-anchored cohort, 11 post-radiotherapy enhancing lesions were classified as RN ...

9
BEGA-UNet: Boundary-Explicit Guided Attention U-Net with Multi-Scale Feature Aggregation for Colonoscopic Polyp Segmentation
2026-03-05 gastroenterology 10.64898/2026.03.04.26347608
Top 3% (1.0%)
Show abstract

Accurate polyp segmentation from colonoscopy images is critical for colorectal cancer prevention, yet the generalization of deep learning models under domain shift remains insufficiently explored. We propose Boundary-Explicit Guided Attention U-Net (BEGA-UNet), a boundary-aware segmentation architecture that introduces explicit edge modeling as a structural inductive bias to enhance both segmentation accuracy and cross-domain robustness. The framework integrates three components: an Edge-Guided ...

10
3D-DXA Cortical and Trabecular Parameters; Agreement and Precision Between GE Healthcare Prodigy and iDXA Densitometers
2026-03-04 radiology and imaging 10.64898/2026.03.04.26347524
Top 4% (0.8%)
Show abstract

3D-DXA reconstructs DXA hip scans to 3-dimensional images allowing measurement of trabecular and cortical bone parameters. Given the higher image quality of GE Healthcare iDXA than GE Healthcare Prodigy, it could be hypothesized that the reconstruction might differ, thereby affecting 3D-DXA results. The aim of the study was to assess agreement and precision of 3D-DXA cortical and trabecular femur parameters between Prodigy and iDXA densitometers in adult subjects. The study cohort was composed o...

11
OncoRAG: Graph-Based Retrieval Enabling Clinical Phenotyping from Oncology Notes Using Local Mid-Size Language Models
2026-03-06 oncology 10.64898/2026.03.05.26347717
Top 5% (0.7%)
Show abstract

Introduction: Manual data extraction from unstructured clinical notes is labor-intensive and impractical for large-scale clinical and research operations. Existing automated approaches typically require large language models, dedicated computational infrastructure, and/or task-specific fine-tuning that depends on curated data. The objective of this study is to enable accurate extraction with smaller locally deployed models using a disease-site specific pipeline and prompt configuration that are ...

12
Efficacy of BodyMirror Clinical MS Multimodal Game-Based Digital Therapeutic for Remote Monitoring and Neurorehabilitation in Multiple Sclerosis: Protocol for a Multisite Randomised Controlled Trial
2026-03-06 neurology 10.64898/2026.03.06.26347719
Top 5% (0.6%)
Show abstract

Multiple sclerosis (MS) is a chronic neurodegenerative disease characterised by progressive neurological disability and heterogeneous symptom trajectories. Current clinical monitoring methods, including magnetic resonance imaging (MRI) and episodic neurological assessments, provide limited insight into subtle disease progression and functional changes. Digital health technologies integrating multimodal biosignals and behavioural assessments may enable continuous monitoring and personalised rehab...

13
Show Your Work: Verbatim Evidence Requirements and Automated Assessment for Large Language Models in Biomedical Text Processing
2026-03-04 health informatics 10.64898/2026.03.03.26346690
Top 5% (0.5%)
Show abstract

PurposeLarge language models (LLMs) are used for biomedical text processing, but individual decisions are often hard to audit. We evaluated whether enforcing a mechanically checkable "show your work" quote affects accuracy, stability, and verifiability for trial eligibility-scope classification from abstracts. MethodsWe used 200 oncology randomized controlled trials (2005 - 2023) and provided models with only the title and abstract. Trials were labeled with whether they allowed for the inclusio...

14
Red-Teaming Medical AI: Systematic Adversarial Evaluation of LLM Safety Guardrails in Clinical Contexts
2026-03-05 health informatics 10.64898/2026.02.26.26347212
Top 6% (0.5%)
Show abstract

BackgroundLarge language models (LLMs) are increasingly deployed in medical contexts as patient-facing assistants, providing medication information, symptom triage, and health guidance. Understanding their robustness to adversarial inputs is critical for patient safety, as even a single safety failure can lead to adverse outcomes including severe harm or death. ObjectiveTo systematically evaluate the safety guardrails of state-of-the-art LLMs through adversarial red-teaming specifically designe...

15
Barriers and facilitators to intracerebral haemorrhage platform trial recruitment: a survey of stroke clinicians
2026-03-06 neurology 10.64898/2026.03.05.26347732
Top 6% (0.5%)
Show abstract

Background: Platform trials are an efficient trial design which enable testing of multiple interventions simultaneously. They could advance knowledge of treatments for intracerebral haemorrhage (ICH). We aimed to investigate the views of clinicians involved in stroke research on recruitment to a future platform trial for ICH. Methods: Between April and July 2025, we conducted a UK-wide online survey of clinicians actively involved in stroke research using convenience sampling through professiona...

16
A Qualitative Study of Patient and Healthcare Provider Perspectives on Mobile Health Assessments for Cervical Spondylotic Myelopathy
2026-03-05 health informatics 10.64898/2026.03.04.26347622
Top 7% (0.5%)
Show abstract

Objective: Evaluating and monitoring patients with cervical spondylotic myelopathy (CSM) remains a challenge due to limited tools for assessing objective neurological disability longitudinally and in the home environment. Given their prevalence and low cost, mobile health (mHealth), and specifically smartphone technologies offer a promising approach to fill this gap. This study explored stakeholder perspectives on the role of mHealth in CSM monitoring to inform development of a smartphone-based ...

17
Enhancing Prediabetes Diagnosis from Continuous Glucose Monitoring Data via Iterative Label Cleaning and Deep Learning
2026-03-05 health informatics 10.64898/2026.03.04.26347604
Top 7% (0.5%)
Show abstract

As of early 2026, over 115 million US adults (more than 1 in 3) have prediabetes, a condition with an annual conversion rate of 5%-10% to type 2 diabetes. Total diabetes (diagnosed and undiagnosed) affects approximately 40.1 million Americans, or 12% of the population, with roughly 1.5 million new cases diagnosed annually. Continuous Glucose Monitoring (CGM) provides real-time, 24/7 insights into glycemic variability, detecting dangerous highs, lows, and trends that HbA1c (a 3-month average) mis...

18
Evaluating a Locally Deployed 20-Billion Parameter Large Language Model for Automated Abstract Screening in Systematic Reviews
2026-03-04 health informatics 10.64898/2026.03.04.26347506
Top 8% (0.5%)
Show abstract

BackgroundSystematic reviews (SRs) are essential for evidence-based medicine but require extensive time and resources for abstract screening. Large language models (LLMs) offer potential for automating this process, yet concerns about data privacy, intellectual property protection, and reproducibility limit the use of cloud-based solutions in research settings. ObjectiveTo evaluate the performance of a locally deployed 20-billion parameter LLM for automated abstract screening in systematic revi...

19
Medical concept understanding in large language models is fragmented
2026-03-05 health informatics 10.64898/2026.03.03.26347552
Top 8% (0.5%)
Show abstract

Large language models (LLMs) perform strongly across a wide range of medical applications, yet it remains unclear whether such success reflects genuine understanding of medical concepts. We present an ontology-grounded, concept-centered evaluation of medical concept understanding in LLMs. Using 6,252 phenotype concepts from Human Phenotype Ontology, we decompose concept understanding into three core dimensions--concept identity, concept hierarchy, and concept meaning--and design corresponding be...

20
Perceptions of Artificial Intelligence in the Editorial and Peer Review Process: A Cross-Sectional Survey of Traditional, Complementary, and Integrative Medicine Journal Editors
2026-03-04 health informatics 10.64898/2026.03.04.26347571
Top 8% (0.4%)
Show abstract

BackgroundArtificial intelligence chatbots (AICs) are increasingly being integrated into scholarly publishing, with the potential to automate routine editorial tasks and streamline workflows. In traditional, complementary, and integrative medicine (TCIM) publishing, editorial and peer review processes can be particularly complex due to diverse methodologies and culturally embedded knowledge systems, presenting unique opportunities and challenges for AIC adoption. MethodsAn anonymous, online cro...